- Home
- Search Results
- Page 1 of 1
Search for: All records
-
Total Resources3
- Resource Type
-
0002000000010000
- More
- Availability
-
21
- Author / Contributor
- Filter by Author / Creator
-
-
Siegel, Zachary (2)
-
Andreas, Jacob (1)
-
Feng, Jiahai (1)
-
Galanti, Tomer (1)
-
Gupte, Aparna (1)
-
Haisu, Liu (1)
-
Korneev, Noa (1)
-
Mao, Jiayuan (1)
-
Muennighoff, Niklas (1)
-
Poggio, Tomaso (1)
-
Sharma, Pratyusha (1)
-
Shi, Quan (1)
-
Shi, Weijia (1)
-
Siegel, Zachary S (1)
-
Su, Hongjin (1)
-
Tang, Michael (1)
-
Tenenbaum, Joshua B (1)
-
Wang, Han-yu (1)
-
Wong, Lionel (1)
-
Xia, Mengzhou (1)
-
- Filter by Editor
-
-
& Spizer, S. M. (0)
-
& . Spizer, S. (0)
-
& Ahn, J. (0)
-
& Bateiha, S. (0)
-
& Bosch, N. (0)
-
& Brennan K. (0)
-
& Brennan, K. (0)
-
& Chen, B. (0)
-
& Chen, Bodong (0)
-
& Drown, S. (0)
-
& Ferretti, F. (0)
-
& Higgins, A. (0)
-
& J. Peters (0)
-
& Kali, Y. (0)
-
& Ruiz-Arias, P.M. (0)
-
& S. Spitzer (0)
-
& Sahin. I. (0)
-
& Spitzer, S. (0)
-
& Spitzer, S.M. (0)
-
(submitted - in Review for IEEE ICASSP-2024) (0)
-
-
Have feedback or suggestions for a way to improve these results?
!
Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Free, publicly-accessible full text available April 24, 2026
-
Wong, Lionel; Mao, Jiayuan; Sharma, Pratyusha; Siegel, Zachary; Feng, Jiahai; Korneev, Noa; Tenenbaum, Joshua B; Andreas, Jacob (, International Conference on Learning Representations)Effective planning in the real world requires not only world knowledge, but the ability to leverage that knowledge to build the right representation of the task at hand. Decades of hierarchical planning techniques have used domain-specific temporal action abstractions to support efficient and accurate planning, almost always relying on human priors and domain knowledge to decompose hard tasks into smaller subproblems appropriate for a goal or set of goals. This paper describes Ada (Action Domain Acquisition), a framework for automatically constructing task-specific planning representations using task-general background knowledge from language models (LMs). Starting with a general-purpose hierarchical planner and a low-level goal-conditioned policy, Ada interactively learns a library of planner-compatible high-level action abstractions and low-level controllers adapted to a particular domain of planning tasks. On two language-guided interactive planning benchmarks (Mini Minecraft and ALFRED Household Tasks), Ada strongly outperforms other approaches that use LMs for sequential decision- making, offering more accurate plans and better generalization to complex tasks.more » « less
-
Galanti, Tomer; Siegel, Zachary; Gupte, Aparna; Poggio, Tomaso (, Center for Brains, Minds and Machines (CBMM))In this paper, we study the bias of Stochastic Gradient Descent (SGD) to learn low-rank weight matrices when training deep ReLU neural networks. Our results show that training neural networks with mini-batch SGD and weight decay causes a bias towards rank minimization over the weight matrices. Specifically, we show, both theoretically and empirically, that this bias is more pronounced when using smaller batch sizes, higher learning rates, or increased weight decay. Additionally, we predict and observe empirically that weight decay is necessary to achieve this bias. Finally, we empirically investigate the connection between this bias and generalization, finding that it has a marginal effect on generalization. Our analysis is based on a minimal set of assumptions and applies to neural networks of any width or depth, including those with residual connections and convolutional layers.more » « less
An official website of the United States government

Full Text Available